Picture for Mu Xu

Mu Xu

ABot-OCR Technical Report

Add code
May 27, 2026
Viaarxiv icon

POINav: Benchmarking and Enhancing Final-Meters Arrival in Real-World Vision-Language Navigation

Add code
May 27, 2026
Viaarxiv icon

VeriTrip: A Verifiable Benchmark for Travel Planning Agents over Unstructured Web Corpora

Add code
May 27, 2026
Viaarxiv icon

ProSR: Process-Shaped Spatial Reasoning for Reliable Chain-of-Thought in VLMs

Add code
May 25, 2026
Viaarxiv icon

Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation

Add code
May 12, 2026
Viaarxiv icon

Why Users Go There: World Knowledge-Augmented Generative Next POI Recommendation

Add code
May 12, 2026
Viaarxiv icon

DeepSight: Long-Horizon World Modeling via Latent States Prediction for End-to-End Autonomous Driving

Add code
May 11, 2026
Viaarxiv icon

ALAM: Algebraically Consistent Latent Transitions for Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

AsyncShield: A Plug-and-Play Edge Adapter for Asynchronous Cloud-based VLA Navigation

Add code
Apr 27, 2026
Viaarxiv icon

Explore Like Humans: Autonomous Exploration with Online SG-Memo Construction for Embodied Agents

Add code
Apr 21, 2026
Viaarxiv icon